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Abstract 

By employing exhaustive lists of large firms in European countries, we show that 
the upper-tail of the distribution of firm size can be fitted with a power-law (Pareto- 
Zipf law), and that in this region the growth rate of each firm is independent of the 
firm's size (Gibrat's law of proportionate effect). We also find that detailed balance 
holds in the large-size region for periods we investigated; the empirical probability 
for a firm to change its size from a value to another is statistically the same as 
that for its reverse process. We prove several relationships among Pareto-Zipf's law, 
Gibrat's law and the condition of detailed balance. As a consequence, we show that 
the distribution of growth rate possesses a non-trivial relation between the positive 
side of the distribution and the negative side, through the value of Pareto index, as 
is confirmed empirically. 

Key words: Pareto-Zipf law, Gibrat law, firm growth, detailed balance, 
Econophysics 
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1 Introduction 

Pareto [1] is generally credited with the discovery, more than a century ago, 
that the distribution of personal income obeys a power-law in high-income 
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range 2 . Firm size also has a skew distribution [3], and quite often obeys a 
power-law in the upper tail of the distribution. In terms of cumulative distri- 
bution P>(x) for firm size x, this states that 

P>(x) oc aT", (1) 



for large x, with /x being a parameter called Pareto index. The special case 
\x = 1 is often referred to as Zipf's law [4]. In this paper we call it Pareto-Zipf 
law, the fact that firm size has a power-law distribution asymptotically for 
large firms. 

Even if the range for which eq. (1) is valid is a few percent in the upper tail of 
the distribution, it is often observed that such a small fraction of firms occupies 
a large amount of total sum of firm sizes. This means that a small idiosyncratic 
shock can make a considerable macro-economic impact. It is, therefore, quite 
important to ask what is the underlying dynamics that governs the growth of 
those large firms. 

Let a firm's size be x\ at a time and x 2 at a later time. Growth rate R is 
defined as the ratio R = xijx\. Law of proportionate effect [5] (see also [6]) is 
a postulate that the growth rate of a firm is independent of the firm's attained 
size, i.e. 

P{R\x\) is independent of x±, (2) 



where P(R\x) is the probability distribution of growth rate conditional on the 
initial size X\. In this paper we call this assumption as Gibrat's law 3 . 

These two laws have been extensively studied in industrial organization and 
related stochastic models [3,6,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24] 
(see [25] for review). Recent study in econophysics [26,27,28,29,30,31,32,33,34,35,36,37] 
introduced some notions and concepts of statistical physics into economics (see 
[38]). Present status related to firm-size growth may be summarized as follows. 
Firm size distribution is approximately log-normal with deviation from it in 
the upper tail of the distribution (e.g. [24] for recent data). On the other hand, 
Gibrat's law breaks down in the sense that the fluctuations of growth rate scale 
as a power-law with firm size; smaller firms can possibly have larger fluctu- 
ations (e.g. [27,28]). However, little attention has been paid to the regime 
of firm size where power-law is dominant rather than log-normality, and to 
the validity of Gibrat's law in that regime. More importantly, any kinematic 

2 See [2] for modern and high-quality personal-income data in Japan. 

3 Another interesting and related quantity is flow, e.g. profits, rather than stock. 
See [7] for growth of individual personal-income and [8] for firms tax-income growth, 
and validity of Gibrat's law. 
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relationship between Pareto-Zipf and Gibrat laws has not been understood 
explicitly, although there have been a lot of works on stochastic dynamics 
since Gibrat. This issue is exactly what the present paper addresses. 

For our purpose it is crucial to employ exhaustive lists of large firms. Our 
dataset for European countries is exhaustive in the sense that each list includes 
all the active firms in each country whose sizes exceed a certain threshold 
of observation. We show that both of the Pareto-Zipf law and Gibrat's law 
do hold for those large firms. As our main result, we prove that Pareto-Zipf 
law implies Gibrat's law and vice versa under detailed balance. By showing 
that the condition of detailed balance also holds in our empirical data, we 
can show the equivalence of Pareto-Zipf law and Gibrat's law as a kinematic 
principle in firms growth, irrespective of the underlying dynamics. Thereby, we 
conjecture that Gibrat's law does hold in the regime of Pareto-Zipf for large 
firms, but does not for smaller firms. Thus our result is not contradictory to the 
breakdown of Gibrat's law in previous study, most notably to the recent work 
by Stanley's group [27,28,29,30]. Furthermore, in the process of our proof, we 
also show that the distribution of growth rate possesses a non-trivial relation 
between the positive side (R > 1) of the distribution and the negative side 
(R < 1), through the value of Pareto index /z, which is confirmed empirically. 

In section 2, we give a brief review of the study on Gibrat's law and firm size 
distribution in economics. In section 3, we describe the nature of our database 
of firms with large size in European countries. In section 4, using exhaus- 
tive lists of large firms in the dataset, we show that Gibrat's law holds in the 
power-law regime for which the firm size distribution obeys Pareto-Zipf law. In 
addition, we uncover that temporal change of individual firm's size in succes- 
sive years satisfies what we call time-reversal symmetry, or detailed balance. In 
section 5, we prove that the two empirical laws of Gibrat and Pareto-Zipf are 
equivalent under the condition of detailed balance. We summarize our results 
in section 6. 



2 Gibrat and Pareto-Zipf Laws in Economics 

Industrial organization literature has long been focused on two empirical facts 
[3,6,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24] (see [25] for review): 

(i) skew distribution of firms size 

(ii) validity or invalidity of Gibrat's law for firm growth 

Gibrat formulated the law of proportionate effect for growth rate to explain 
the empirically observed distribution of firms. The law of proportionate effect 
states that the expected increment to a firm's size in each period is propor- 
tional to the current size of the firm. Let x t and x 4 _a< be, respectively, the 
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size of a firm at time t and t — At, and e t denote the proportionate rate of 
growth. The the postulate is expressed as 

x t - X t -At = e t x t -At- 

Gibrat assumed (a) that e t is independent of x t (Gibrat's law), (b) that e t has 
no temporal correlation, and (c) that there is no interaction between firms. 
Then, after a sufficiently long time t 3> At, since 

x t = x (l + ei)(l + e 2 ) • • • (1 + e t ), 

log x t follows a random walk. Assuming that e t is small, one has 
\ogx t = logx + e 1 + e 2 H h e t . 

Gibrat's model has two consequences concerning the above points (i) and (ii). 
Since the growth rate defined by Rt = x t /xo has its logarithm as the sum 
of independent variables et, the growth rate is log-normally distributed. In 
addition, assuming that all the firms have approximately the same starting 
time and size, the distribution of firms size is also also log-normal with mean 
and variance given by rat and a 2 t, respectively, where m is the mean of e t and 
a 2 is the variance of e t . 

The assumptions (a)-(c) in Gibrat's model are in disagreement with empiri- 
cal evidence. Among others, the Gibrat's law (a) is incompatible with the fact 
that the fluctuations of growth rate measured by standard deviation decreases 
as firm size increases [12,13,16,18,19]. Especially, the recent work [27,28] by 
Stanley's group showed that the distribution of the logarithm of growth rates, 
for each class of firms with approximately the same size, displays an expo- 
nential form (Laplace distribution) rather than log-normal. They also show 
that the fluctuations in the growth rates characterized by the standard devia- 
tion cr(x) of the distribution decreases for larger size of firms as a power-law, 
a(x) ~ x" 13 , with the exponent (5 is less than a half. The latter point suggests 
a new viewpoint about the interplay of different parts of a firm, an industrial 
sector, or an organization [29,30]. 

In contrast to the standard deviation, the measure by mean growth rate has 
been disputed. There were studies which showed that smaller firms grow faster 
[19] or slower [15,20] than bigger ones. However, it is generally thought that 
the proportional rate of growth of a firm (conditional on survival) is decreasing 
in size, as far as small and medium firms are concerned, which share a large 
fraction of industrial sectors in number. However, the remaining larger firms 
constitute a small fraction in number, but occupy a large fraction of total sum 
of firms size. This is due to the effect of heavy tail, much heavier than expected 
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from log-normal regime. See recent works [26,24] 4 . This is the Pareto-Zipf 
regime which we focus on in this paper. 

On the other hand, the assumption (b) about temporal correlation between 
successive growth rates are not investigated with definite conclusions. [17], 
for example, showed that the distribution of growth rates shows a first-order 
positive autocorrelation: the growth process will result faster for firms which 
recorded a sharp growth in previous years. ([17] also furnished a test test for 
the validity of Gibrat's Law that takes into account the "historical memory" 
of the growth process.) 

Gibrat's work also opened up a stream of theoretical models and ideas. Kalecki 
noted that Gibrat's model leads to "unrealistic" feature, that is, the variance 
of the size distribution would increase indefinitely with time. He considered 
several models, one of which assumed that the expected rate of growth in- 
creased less than proportionately, leading to a log-normal distribution with 
constant variance. 

Herbert Simon considered it more important that the firm size distribution has 
heavy tail in upper-region of size, which was better fitted by Yule distribution 
or asymptotically a Pareto-Zipf law. In order to explain such a distribution, 
based on his earlier work [11] for the explanation of Zipf's law in word fre- 
quency, he assumed Gibrat's law (in a much weaker form than ours) with a 
boundary condition for entry and exit of firms. In conformity with preceding 
work by Champernowne for personal income [10], Simon could show that the 
emergence of power-law behavior is quite robust irrespectively of modifica- 
tion of the stochastic process (see [3] for collection of related papers). Simon 
modeled the process of entry corresponding to new firms which compete with 
existing firms to catch market opportunities. This line of models was followed 
by [21] which relaxes the assumption of Gibrat's law, and also by [37] which 
explained the Laplace distribution for growth rate. Simon also extended his 
model incorporating merger and acquisition process (see also [22] for recent 
work). These works attempted to take into account the direct and indirect 
interactions among firms, which was ignored in the assumption (c) above. 

Our work is in affinity with Simon's view in the points that the upper-tail of 
size distribution, Pareto-Zipf law, is focused rather than the log-normal regime, 
and that the origin of it is related to Gibrat's law and boundary condition of 

4 [26] observed that log-normal distribution overestimates the upper-tail of size dis- 
tribution based on Computat in U.S. As noted in the paper, the dataset is consisting 
of only publicly-traded firms. This can be a possible cause of their observation. [24] 
used much larger dataset in U.K. Though their plot showed a power-law regime 
over several orders of magnitude, they rejected the hypothesis of power-law due to 
the presence of super-giant firms. We consider that both of the these points deserve 
further investigation. 



5 



entry-exit of firms. It is interesting to point out that Mansfield [14] , following 
Simon's model, empirically showed that the Gibrat's law seemed to hold only 
above a certain minimum size of firms. (See also [25] for the influence of [14] 
onto later work.) 

At the end of this brief survey, let us point out why recent advent of econo- 
physics can have important impact on economics. The econophysics approach 
attempts to treat the whole industrial organization as a complex system, in 
which firms are interacting atoms, that exhibits universal scaling laws [38]. 

Concerning firm size, the Pareto-Zipf power-law distribution has a long history 
since the seminal work by Herbert Simon, but its study extending to the de- 
tails of growth rate was only recently facilitated by modern datasets with good 
abundance and quality. In this line of research, resent findings (e.g. [34,36]) 
showed that power-law distribution gives a very good fit for different samples 
of firm size. In this paper we shall not only confirm this fact with different 
European countries and for different measures of size, but also uncover the 
underlying kinematics that relates Pareto-Zipf law to Gibrat law explicitly. 
Following the notion of self-organized criticality [39,40], the occurrence of a 
power-law reveals that a deep interaction among system's subunits, reacting 
to idiosyncratic shocks, leads to a critical state in which no attractive points 
nor states emerge. Such interaction and critical states are so important notions 
with that of self-organized criticality. Under economic point of view, interac- 
tion means that it is not possible to define a representative agent because the 
dynamics of the system is originated just from the interaction among hetero- 
geneous agents. Moreover, in consequence of critical state, equilibrium exists 
only as asymptote, along which the system moves from an unstable critical 
point to another. The authors believe that economics can enjoy these ideas 
coming from econophysics on heterogeneous interacting agents (see [41] [42] for 
an example). 



3 Dataset of European Firms 

We use the dataset, Bureau van Dijk's AMADEUS, which contains descriptive 
and balance data of about 260,000 firms of 45 European countries for the years 
1992-2001. For every firm are reported a number of juridical, historical and 
descriptive data (as e.g. year of inclusion, participations, mergers and acqui- 
sitions, names of the board directors, news, etc.) and a series of data drawn 
from its balance and normalized. It reports the current values (for several cur- 
rencies) of stocktaking, balance sheet (BS), profit and loss account (P/L) and 
ratios. The descriptive data are frequently updated while the numerical ones 
are taken from the last available balance. Since balance year does not always 
match conventional year, the number of firms included may vary during the 
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year if one of the excluded firms in last recording satisfy one of the criteria 
described below. The amount and the completeness of available data differs 
from country to country. To be included in the data set firms must satisfy at 
least one of these three dimensional criteria: 



• for U.K., France, Germany, Italy, Russian Federation and Ukraine 

• operating revenue equal to at least 15 million Euro 

• total assets equal to at least 30 million Euro 

• number of employees equal to at least 150 

• for the other countries 

• operating revenue equal to at least 10 million Euro 

• total assets equal to at least 20 million Euro 

• number of employees equal to at least 100 

As a proxy for firm size, we utilize one of the financial and fundamental vari- 
ables; total-assets, sales and number of employees. We use number of employ- 
ees as a complementary variable so as to check the validity and robustness 
of our results. Note that the dataset includes firms with smaller total-assets, 
simply because either the number of employees or the operating revenue (or 
both of them) exceeds the corresponding threshold. We thus focus on complete 
sets of those firms that have larger amount of total-assets than the threshold, 
and similarly those for number of employees. For sales, we assume that our 
dataset is nearly complete since a firm with a small amount of total-assets 
and a small number of employees is unlikely to make a large amount of sales. 
For our purposes, therefore, we discard all the data below each corresponding 
threshold for each measure of firm size. This procedure makes the number of 
data points much less. However, for a several developed countries, we have 
enough amount of data for the study of Gibrat's law. In what follows, our 
results are shown for UK and France, although we obtained similar results for 
other developed countries. The threshold for total-assets in these two coun- 
tries is 30 million euros, and that for number of employees is 150 persons, as 
described above. For sales, we used 15 million euros per year as a threshold. 
We will also show results for Italy and Spain in addition to U.K. and France 
only when examining the annual change of Pareto indices. 

It should be remarked that other problems in treating these data takes origin 
from the omission, in the on-line dataset of AMADEUS, of the date of upgrade, 
so that it is often not clear when a firm changed its juridical status, or went 
bankrupted or inactive. For some countries the indication activity /inactivity is 
not shown at all, so that it was impossible, even indirectly, to individuate the 
year of exit. Therefore, our study should be taken as the analysis conditional 
on survival of firms. 
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4 Firm Growth 



In this section, our results are shown for UK and France, and for total-assets, 
number of employees and sales. Each list of firms is exhaustive in the way we 
described in the preceding section. 

4-1 P areto- Zip f distribution 

First we show that the distribution of firm size obeys a power-law in the range 
of our observation whatever we take as a variable for firm size. Fig. 1 depicts 
the cumulative distributions for total-assets in France (a), sales in France (b), 
and number of employees in UK (c). The number of data points is respectively 
(a) 8313, (b) 15776 and (c) 15055. 

Pareto-Zipf law states that the cumulative distribution P>(x) for firm size x 
follows eq. (1). The power-law fit for x > Xq, where Xq denotes the thresh- 
old mentioned above for each measure of firm size, gives the values of /x; (a) 
0.886±0.005, (b) 0.896±0.011, (c) 0.995±0.013 (standard error at 99% signifi- 
cance level) . /x is close to unity. Note that the power- law fit is quite well nearly 
three orders of magnitude in size of firms. 

Pareto index is surprising stable in its value. Fig. 2 is a panel for the annual 
change of Pareto indices for four countries, Italy, Spain, France and U.K. 
estimated from total- assets, number of employees and sales (except U.K.). 
Different measures of firm size give reasonably same behavior. It is observed 
that the value /x is quite stable being close to unity in all the countries. 

4-2 Gibrat's law 

Let us denote a firm's size by x and its two values at two successive points 
in time (i.e., two consecutive years) by x\ and x 2 - Growth rate is given by 
R = x 2 /xi. We also express the rate in terms of its logarithm, r = log 10 R. We 
examine the probability density for the growth rate P(r\xi) on the condition 
that the firm size x\ in an initial year is fixed. 

For the conditioning we divide the range of X\ into logarithmically equal bins. 
For the total-assets in the dataset (Fig. 3 (a)), the bins are taken as X\ G 3 x 
[ 10 7+o.4(n-i) ; 10 7+o.4nj ( euros ) with n = 1, • • • , 5. For the sales in (b), x x G 1.5 x 
^g7+o.4(n-i)^ ^g7+o.4nj ( euros ) w ith n — 1, • • • , 5. For the number of employees 
in (c), X! E 1.5 x [io 2 +°- 4 («-i) ) io 2 +o- 4 ™] (persons) with n = 1, • • • , 5. In all 
the cases, the range of conditioning covers two orders of magnitude in each 
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variable. We calculated the probability density function for r for each bin, and 
checked the statistical dependence on x\ by graphical method. 

Fig. 3 is the probability density function P(r\xi) for each case. It should be 
noted that due to the limit X\ > x and x 2 > x , the data for large negative 
growth are not available. In all the cases, it is obvious that the function P(r\x\) 
has little statistical dependence on x\, since all the curves for different n 
collapse on a single curve. This means that the growth rate is independent of 
firm size in the initial year. That is, Gibrat's law holds. 

4-3 Time-reversal symmetry 

The validity of Gibrat's law in the Pareto-Zipf regime appears to be in dis- 
agreement with recent literature on firm growth. In the next section, we will 
show that this is not actually the case by proving that Gibrat and Pareto- 
Zipf are equivalent under an assumption. The assumption is detailed balance, 
whose validity is checked here. 

Let us denote the joint probability distribution function for the variable x\ 
and X2 by Pi 2 (xi, x 2 ). The detailed balance, or what we call time-reversal sym- 
metry, is the assumption that Pi 2 (x±, x 2 ) = P\i{x 2 , x i)- The joint probabilities 
for our datasets are depicted in Fig. 4 as scatter-plots of individual firms. 

We used two different methods to check the validity of time-reversal symmetry. 
One is an indirect way to check a non-trivial relationship between the growth- 
rate in positive side (r > 0) and that in negative (r < 0). That is, as we shall 
prove in the next section, the probability density distribution in positive and 
negative growth rates must satisfy the relation given by eq. (27), if the property 
of time-reversal symmetry holds. We fitted the cumulative distribution only for 
positive growth rate by a non-linear function, converted to density function, 
and predicted the form of distribution for negative growth rate by eq. (27) so 
as to compare with the actual observation (see Appendix for details). In each 
plot of Fig. 3, a solid line in the r > side is such a fit, and a broken line in 
the r < side is our prediction. The agreement with the actual observation is 
quite satisfactory, thereby supporting the validity of time-reversal symmetry. 

The other way we took is a direct statistical test for the symmetry in the two 
arguments of P\ 2 (x\, x 2 ). This can be done by two-dimensional Kolmogorov- 
Smirnov (K-S) test, which is not widely known but was developed by astro- 
physicists [43,44,45]. This statistical test is not strictly non-parametric (like 
the well-known one-dimensional K-S test), but has little dependence on parent 
distribution except through coefficient of correlation. We compare the scatter- 
plot sample for Pi 2 (xi, x 2 ) with another sample for x\ and x 2 interchanged 
by making the null hypothesis that these two samples are taken from a same 
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parent distribution. We used the logarithms £1 = logxi and £2 = logX2, and 
added constants to ^ and £2 so that the average growth rate is zero. This 
addition (or multiplication in x\ and x 2 ) is simply subtracting the nominal 
effects due to inflation, etc. We applied two-dimensional K-S test to the re- 
sulting samples. The null hypothesis is not rejected in 95% significance level 
in all the cases we studied. 



5 Pareto-Zipf 's law and Gibrat's law under detailed balance 

In the preceding section, we have shown that both of Pareto-Zipf and Gibrat's 
laws hold for large firms. This suggests that these two laws are closely related 
with each other. We show in this section that in fact they are equivalent to 
each other under the condition of detailed balance. 

Let 1 be a firm's size, and let its two values at two successive points in time 
(i.e., two consecutive years) be denoted by x\ and x 2 . We denote the joint 
probability distribution function (pdf) for the variable x\ and x 2 by Pi 2 (x 1 , x 2 ). 
The joint pdf of x\ and the growth rate R = x 2 jx\ is denoted by P\r(x\, R). 
Since Pi 2 (xi, x 2 )dx\dx 2 = Pir(x\, R)dx\dR under the change of variables from 
(x\,x 2 ) to (xi,R), these two pdf's are related to each other as follows: 




(3) 



We define conditional probabilities: 



P m (x 1 ,R) = P 1 (x 1 )Q(R\x l ) 
= P R (R)S( Xl \R), 



(4) 
(5) 



Both P\(xi) and Pr(R) are marginal: 





00 




(7) 







since the following normalizability conditions are satisfied: 



oo 




(8) 
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oo 

1 = J S(x 1 \R)dx 1 . 





(9) 



Three phenomenological properties can be summarized as follows. 



(A) 



Detailed Balance (Time-reversal symmetry): 
The joint pdf ^12(^1,^2) is a symmetric function: 



^12(^1,^2) = ^12(^2,^1)- 



(10) 



(B) 



Pareto-Zipf's law: 

The pdf P\(x) obeys power-law for large x: 



P\{x) oc x 



-M-i 



(11) 



(C) 



for x — > 00 with /i > 0. 
Gibrat's law: 

The conditional probability | x) is independent of x: 



g( J R|x) = Q( J R). 



(12) 



We note here that this holds only for large x, because we confirmed it 
in actual data only in that region, and because otherwise it leads to an 
inconsistency as we will see shortly. This relation was called Universality 
in [7,8,46,47]. All the arguments below is restricted in this region. 

Before starting our discussion of interrelation between these properties, let us 
first rewrite the detailed balance condition (A) in terms of Pir(x±, R): 

P 1R (x 1 , R) = x 1 P 12 (x 1 ,x 2 ) 
= x 1 P 12 (x 2 ,xi) 

= —X 2 Pl2{x 2 ,X l ) 



where eq. (10) was used in the second line, and eq. (3) was used in the first 
and the third line. The above relation may be rewritten as follows by the use 
of the conditional probability Q(R\xi) in eq. (5); 




(13) 



QCR- 1 !^) 
Q{R\x 1 ) 



= R 



Pijxi) 
Pi(x 2 y 



(14) 



In passing, it should be noted that eq. (13) leads to the following: 
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P R (R) = J P 1R {x 1 ,R)dx l 



CO 

= J R^Pw (x 2 ,R- 1 ) dX! 



CO 

= J R- 2 P 1R (x 2 ,R~ 1 ) dx 2 



= R- 2 P R (R- 1 ) (15) 

where eq. (13) was used in the second line, and the third line is merely change 
of integration variable. This relation between the marginal growth-rate pdf 
Pr(R) for positive growth (R > 1) and negative growth (R < 1) leads to the 
following relation, as it should: 

CO 1 

J P R (R)dR = J P R (R)dR. (16) 
1 o 

5.1 (A)-h(CMB) 

Let us first prove that the properties (A) and (C) lead to (B). By substituting 
the Gibrat's law eq. (12) in eq. (14), we find the following: 

Pi(xi) _ 1 QjR- 1 ) 

Pi(x2) R Q(R) ' 1 ' 

This relation can be satisfied only by a power-law function eq. (11). 
[Proof] 

Let us rewrite eq. (17) as follows: 

P 1 (x) = G(R)P 1 (Rx), (18) 



where x denotes x\, and G(R) denotes the right-hand side of eq. (17), i.e. 



We expand this equation around R = 1 by denoting R = 1 + e with e< 1 as 
Pi(a;) = G(l + e)Pi((l + e)a;) 



12 



= (1 + G'(l)e + • • -)(Pi(x) + P[{x)ex + • • •) 
= P^x) + e(G"(l)P 1 (x) + xP;(x)) + 0(e 2 ), 



(20) 



where we used the fact that G(l) = 1. We also assumed that the derivatives 
G"(l) and P{(x) exists in the above, whose validity should be checked against 
the results. From the above, we find that the following should be satisfied 

G'(l)P 1 (x)+xPi{x) = 0, (21) 



whose solution is given by 

P 1 (x) = Cx- G 'V>. (22) 



This is the desired result, Pareto-Zipf's law, and is consistent with the as- 
sumption made earlier that P[(x) exists. By substituting the result eq. (22) 
in eq. (19) and eq. (17), we find that 

G(R) = R G ' {1) , (23) 



which is consistent with the assumption that G'(l) exists. 

[Q.E.D.] 

From eq. (19) we may calculate G'(l) in terms of derivatives of Q(R). It should, 
however, be noted that Q(R) has a cusp at R = 1 as is apparent in Fig. 3, 
and therefore Q'(R) is expected not to be continuous at R — 1. Bearing this 
in mind, we calculate G{\ + e) for < e <C 1 as follows: 



V ; 1 + e Q(l + e) 

~ [ ' Q(i) + eQ + '(i) 

where we denoted the right-derivative and left- derivative of Q(R) at R — 1 by 
the signs + and — in the superscript, respectively. From the above, we find 
that 

g( i ) = -i- gflHp) , (25) 
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From eq. (22) and eq. (25), we find that 



Q + '(l) + Q-'(l) 



= -fi-2. 



(26) 



From eqs.(19) and (23), we find the following relation: 



Q{R) = R~^ 2 



QiR- 1 ), 



(27) 



which should be in contrast to eq. (15). This is related to the point that we 
mentioned in eq. (12): If the Gibrat's law eq. (12) holds for all x € [0,oo], 
then Pr(R) = Q{R) from eq. (7). If so, eq. (27) contradicts to eq. (15) since 
H > 0. Besides, the Pareto-Zipf 's law we derived from Gibrat's law is not 
normalizable if it holds for any x. Therefore, Gibrat's law should hold only for 
large x. 

The result eq. (27) shows that the function Q(R) is continuous at it! = 1, as 
is easily seen by substituting R — 1 + e with e > on both hand side and 
taking the limit e — > +0. Also, by taking the derivative of the both hand side 
and taking the limit in a similar manner, we can reproduce eq. (26). 

5.2 (A) + (B)^ ? 

Let us next examine what we obtain if we had only Pareto-Zipf's law instead 
of Gibrat's law under the detailed balance. 

In this case, substituting the Pareto-Zipf's law eq. (11) into eq. (14) we find 
that 



QiR- 1 


\Rx) 


Q(R\ 


x) 



where we denote X\ by x and x 2 by Rx. We now define a function H(z,x) as 



It should be noted that this does not constrain Q{R\x) in any way: arbitrary 
function of the variable R and x can be written in the form of eq. (29). By 
substituting eq. (29) into eq. (28), we find that 



Q(R\x) = x tx+2 H(R 1 ' 2 x,x). 



(29) 



H(R 1/2 x,Rx) = H{R 1/2 x,x), 



(30) 
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which means that the function H(z,x) has the following invariance property. 
H(z,x) = H(z,z 2 /x). (31) 



Other than this constraint and some trivial constraint such as continuity, there 
is no nontrivial constraint on H(z,x) or Q(R \ x). 

The results eqs. (29) and (31) is a generalization of the property eq. (27) we 
found earlier [7]. In fact, the property eq. (27) follows from eq. (31) in the 
special case: 

H(z,x) = Q((z/x) 2 )x-»- 2 , (32) 
for which eq. (29) becomes eq. (12), namely the statement of Gibrat's law. 

5.3 (B) + (C)^(A)? 

Let us discuss the last question: Under Pareto's and Gibrat's laws, what can 
we say about the detailed balance? In order to answer this, we use eq. (11) 
and eq. (12) to write Pir(x, R) for large x as follows: 

P 1R (x,R)=Ax-»- 1 Q(R), (33) 

where A is a proportionality constant. According to eq. (13), the detailed 
balance is satisfied if this is equal to 

R-'PmixR, R- 1 ) = Ax'^R-^QiR- 1 ), (34) 

where we used eq. (33). Therefore, we find that the detailed balance condition 
is equivalent to eq. (27) in this case. 

Summarizing this section, we have proved that under the condition of detailed 
balance (A), if the Pareto-Zipf law (B) holds in a region of firm size, then the 
Gibrat's law (C) must hold in the region, and vice versa. The condition (A) 
means detailed-balance. On the other hand, if both of (B) and (C) hold, (A) 
follows provided that eq. (27) holds, eq. (27) is our prediction which gives a 
non-trivial relation between positive growth (R > 1) and negative (R < 1). 
This kinematic relation was empirically verified in Fig. 3. See also previous 
work [7,8,46,47] for the validity of this relation in personal income and firms 
tax-income in Japan. 
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6 Summary 



The distribution of firm size is quite often dominated by power-law in the 
upper tail over several orders of magnitude. This regime of Pareto-Zipf law 
is different from log-normal distribution in the lower and sometimes wider 
regime of firm size. The upper tail is occupied by a small number of firms, but 
they dominate a large fraction of total sum of firm size. 

By using exhaustive datasets of those large firms and with different measures 
of firm size in Europe, we show that the Pareto-Zipf law holds as in eq. (1) 
for firm size x larger than observational threshold xq, and that Gibrat's law of 
proportionate effect holds as in eq. (2) for successive sizes X\ and x 2 exceeding 
x , stating that the growth rate of each firm is independent of initial size. 
We also find that detailed balance holds which means that the frequency of 
transition from x\ to x 2 is statistically the same as that for its reverse process. 
The Gibrat's law, Pareto-Zipf's law and detailed balance condition are related 
to each other. We prove various relationships among them. It follows as one of 
the consequences that there exists a relation between the positive and negative 
sides of the distribution of growth rate via the Pareto index. The relation is 
confirmed empirically in our dataset of European firms. 
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x (person) 

Fig. 1. Cumulative probability distribution P>{x) for firm size x. (a) Total-assets 
in France (2001) greater than 30 million euros, (b) sales in France (2001) greater 
than 15 million euros, (c) number of employees in UK (2001) larger than 150 per- 
sons. Lines are power-law fits with Pareto indices, (a) 0.886, (b) 0.896, (c) 0.995 
(least-square- fit in logarithmic scale). 
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Fig. 2. Annual change of Pareto indices for Italy, Spain, France and U.K. from 1993 
to 2001 for total-assets, number of employees, and sales (except U.K.). The estimate 
of Pareto index in each year was done by extracting a range of distribution corre- 
sponding to large-size firms, which is common to different countries but different for 
different measure of size, and by least-square-fit in logarithmic scales of rank and 
size. 
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Fig. 3. Probability density P(r\x\) of growth rate r = log 10 (x2/xi) for the two 
years, 2000/2001. The datasets in (a)-(c) are the same as in eq. (1). Different bins 
of initial firm size with equal magnitude in logarithmic scale were taken over two 
orders of magnitude as described in the main text. The solid line in the portion of 
positive growth (r > 0) is a non-linear fit. The dashed line (r < 0) in the negative 
side is calculated from the fit by the relation given in the equation eq. (27). 
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( a ) France total assets (2001/2000) 




x 1 (euro) 
(b) France sales (2001/2000) 




10 2 10 3 10 4 10 5 
x-| (person) 

Fig. 4. Scatter-plot of all firms whose size exceeds a threshold. The datasets in 

(a) -(c) are the same as in eq. (1). Thresholds are (a) 30 million euros for total-assets, 

(b) 15 million euros for sales, and (c) 150 persons for number of employees. The 
number of such large firms is respectively (a) 6969, (b) 13099 and (c) 12716. 
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A Fitting distribution of growth rate 



For the purpose of fitting probability density function of positive growth rate 
(R > 1), we used cumulative distribution of positive growth rate, defined by 

P+(R) = Prob{# > R\R > 1}. 



P>{R) can be estimated, as usual, by size versus rank plot restricted only for 
R > 1 as follows. Let the number of all firms with R > 1 be N+, and sort their 
growth rates in descending order: R.W > R^ > ■■■ > R {k) > ■■■ > R( N +\ 
Then the estimate is given by 



P?(R) = ^~ = C- 1 J P 1R (x 1 ,R)dx 1 dR, 



where = {(x 1 , R)\x 1 > x ,R > R^} (x is the observational threshold 
mentioned in section 4.1), and C is the normalization: 



oo oo 



C = j d Xl j dRP 1R { Xl ,R). 



XQ 



Using the observational fact that eq. (12) holds in the region {(x 1 , R)\xi > 
Xq,R> 1}, the above equation for (R) reads 

oo oo oo 

p+(R) = C- 1 J dx 1 P l {x l )dx l J dR'Q(R') = Qq- 1 j Q(R')dR', (A.l) 

xo R R 

where the normalization factor is written by 

oo 

Qo = J Q(R')dR'. (A.2) 
i 

By taking derivative of eq. (A.l) with respect to R, it follows that 



Q(R) = -Qo-^ p >(R)- ( A -3) 



We empirically found that the rank-size plot can be well fitted by a non-linear 
function of the form: 



log 10 P+(R = 10 r ) = -a(l - e- br ) -cr = F(r), (A.4) 
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Fig. A.l. Cumulative probability P> (R = 10 r ) for the growth of total-assets in 
France (2001/2000). n is the index of bin used in Fig. 3 (a), "all" means the plot 
for all the dataset of positive r. "Fit" is done by the non-linear function given by 
the equation eq. (A. 4). 

where a, b and c are parameters. An example is given in Fig. A.l for France 
total-assets (2001/2000). Cumulative probabilities P£(r\xi) (the left-hand 
side of eq. (A. 4)) conditioned on an initial year's total-assets are shown for 
each of the same bins used in Fig. 3 (a), but restricted to the data with posi- 
tive r. The non-linear fit done by eq. (A. 4) is represented by a solid and bold 
line in the figure. Note also that the curves for different bins almost collapse 
because of the statistical independence of aq. 

Under the change of variable, r = log 10 R, the probability density for r defined 
by q(r) is related to that for R by 



logic = log 10 Q{R = W) + r + log 10 (ln 10) 



(A.5) 



Therefore it follows from eq. (A. 3) and eq. (A.5) that 

dF(r ~ 



logi ?M = F{r) + log 10 



dr 



+ log 10 Q + log 10 (lnl0). 



(A.6) 



In each plot of Fig. 3, the solid curve is given by eq. (A.6), where P(r\x\) 
denotes the probability density function q(r) for r, conditioned on an initial 
year's size x±. 

The relation eq. (27) for positive (R > 1) and negative (R < 1) growth rates 
can be written in terms of q(r) as 

i°gio ?( r ) = + lo Sio q(- r ), ( A - 7 ) 
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which is easily shown by eq. (A. 5). In each plot of Fig. 3, the dotted curve for 
negative growth rate (r < 0) is obtained from the solid curve for positive one 
(r > 0) through the relation eq. (A. 7). 
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